A Statistical NLP Approach for Feature and Sentiment Identification from Chinese Reviews
نویسندگان
چکیده
Existing methods for extracting features from Chinese reviews only use simplistic syntactic knowledge, while those for identifying sentiments rely heavily on a semantic dictionary. In this paper, we present a systematic technique for identifying features and sentiments, using both syntactic and statistical analysis. We firstly identify candidate features using a proposed set of common syntactic rules. We then prune irrelevant candidates with topical relevance scores below a cut-off point. We also propose an association analysis method based on likelihood ratio test to infer the polarity of opinion word. The sentiment of a feature is finally adjusted by analyzing the negative modifiers in the local context of the opinion word. Experimental results show that our system performs significantly better than a well-known opinion mining system.
منابع مشابه
Feature-based Sentiment Analysis on Android App Reviews Using SAS® Text Miner and SAS® Sentiment Analysis Studio
Sentiment analysis is a popular technique for summarizing and analyzing consumers’ textual reviews about products and services. There are two major approaches for performing sentiment analysis; statistical model based approaches and Natural Language Processing (NLP) based approaches to create rules. In this study, we first apply text mining to summarize users’ reviews of Android Apps and extrac...
متن کاملA Grouping Hotel Recommender System Based on Deep Learning and Sentiment Analysis
Recommender systems are important tools for users to identify their preferred items and for businesses to improve their products and services. In recent years, the use of online services for selection and reservation of hotels have witnessed a booming growth. Customer’ reviews have replaced the word of mouth marketing, but searching hotels based on user priorities is more time-consuming. This s...
متن کاملExtracting Features for Sentiment Classification: in the Perspective of Statistical Natural Language Processing
In order to improve the accuracy of sentiment classification especially for Chinese online reviews, this paper proposed an approach to extract features for sentiment classification effectively and efficiently. The proposed method is based on supervised machine learning and follows the basic procedure of statistical natural language processing (SNLP). Through feature selection, feature extractio...
متن کاملWEMOTE - Word Embedding based Minority Oversampling Technique for Imbalanced Emotion and Sentiment Classification
Imbalanced training data always puzzles the supervised learning based emotion and sentiment classification. Several existing research showed that data sparseness and small disjuncts are the two major factors affecting the classification. Target to these two problems, this paper presents a word embedding based oversampling method. Firstly, a large-scale text corpus is used to train a continuous ...
متن کاملFeature-based Product Review Summarization Utilizing User Score
With the steadily increasing volume of e-commerce transactions, the amount of userprovided product reviews is increasing on the Web. Because many customers feel that they can purchase product based on the experiences of others that are obtainable through product reviews, the review summarization process has become important. In particular, feature-based product review summarization is needed in...
متن کامل